AN ELEMENTARY PROOF THAT SUBGROUPS OF FREE 
GROUPS ARE FREE 

BENJAMIN STEINBERG 

Abstract. We provide an elementary proof that subgroups of free groups are 
free via group actions. 



1. Introduction 

A group F is free on a set X if there is a function l : X — > F such that given any 
mapping p: X — > G with G a group, there is a unique homomorphism ip: F — > G 
making the diagram 

X ^F (1) 




G 



commute. One can easily prove that t is injective and the image of t generates 
F . One can then view elements of F as words over the alphabet X X^^ . One 
proves that each word represents the same clement of F as a unique reduced word, 
called its reduced form; a word is reduced if it has no factor of the form xx~^ with 
X £ XUX~^. When convenient one identifies F with the set of reduced words over 
XUX-\ See H] for details. 

The classical Nielsen-Schreier theorem says that every subgroup of a free group 
is free. The original proofs were combinatorial in nature, and therefore not very 
appealing. Short conceptual proofs appeared later based on covering spaces of 
graphs and the Seifert-van Kampen theorem. However, this approach is beyond 
the scope of a first graduate algebra course. The purpose of this note is to give an 
elementary, yet conceptual, proof that subgroups of free groups are free. The idea 
is similar to an approach of the author and Ribes using wreath products pi , but we 
simplify things here by pursuing the avenue of group actions instead. If X is a nice 
topological space, then the category of covering spaces of X is equivalent to the 
category of 7ri(X)-sets, so it is clear that the topological proof should correspond 
to a group actions proof. We assume nothing about free groups beyond what is in 
the previous paragraph. 

2. Group actions 

In this paper we shall work with right actions of groups. The symmetric group 
on a set A is written Sa- The identity of a group is denoted 1. 
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2.1. Tensor products. If G is a group, then by a G-set, we shall always mean 
a right G-set. Let H he a. subgroup of G and A an H-set. Then H acts on the 
right of A X G by {a,g)h = {ah, h~^g). The set of orbits of this action is denoted 
A (E)H G, as it is the natural notion of a tensor product in this context, cf. [2|. The 
orbit of (a, g) is denoted a®g. Observe that A ®h G is a right G-set via the action 
(a ® g)g' = a® gg' . 

Let r be a transversal to the set of right cosets G/H (i.e., a set of coset repre- 
sentatives) and denote by g the element of T representing the coset Hg for g ^ G; 
we shall always assume that 1 € T. Notice that the map A x G/H — > A ®h G 
given by (a, Hg) >->• a is a bijection. Indeed, aCSi g — a® g{g)~^g = ag{g)~^ (g) g 
and so the map a ® g i-^ {ag{'g)^^, Hg) (which is easily checked to be well defined) 
is the inverse bijection. We often identify A(E)h G with A x G/H via this bijection. 
The action of G transfers via the bijection as: 

{a,Hg)g' ^{agg'(^)~\Hgg'). (2) 

Notice that ii h e H, then (a, H)h = (ah, H) using that I = I = h. Thus A x {H} 
is an 7f-invariant subset, isomorphic to A as an H-set via the projection to the first 
coordinate. From now on we do not distinguish these isomorphic H-sets. 

2.2. A characterization of free groups. It is well known that two groups G and 
H are isomorphic if and only if the category of G-sets is equivalent to the category 
of _ff-sets. Thus the following group action characterization of free groups should 
come as no surprise. 

Proposition 1. Let X be a set and F a group equipped with a map l: X — y F. 
Then F is a free group on X ( with respect to the mapping i) ij and only if given 
any set A and any map a: X — > Sa, there is a unique action of F on A such that 

aL{x) = a-{x){a) 

for all X ^ X and a ^ A. 

Proof. Clearly if F is free on X, then it has the property described in the propo- 
sition. For the converse, let p: X — > G be a mapping with G a group. We need 
to construct a unique homomorphism Lp : F — !■ G such that the diagram ([T|) com- 
mutes. Define a: X — > So by a(x){g) — gp{x). Then there is a unique action of 
F on G such that g ■ l{x) — gp{x) for all 5 G G, x G X by hypothesis, where we use • 
to distinguish the action from multiplication in G. We claim that h{g-w) = {hg) -w 
for all h,g ^ G and w & F. Indeed, fix /i G G. It is immediately verified that the 
formula g Qw = h~^[{hg) ■ w] provides an action of F on G. Moreover, 

g l{x) = h^^[{hg) ■ i{x)] = h^^[hgp{x)] = gp{x). 

Uniqueness now implies that g Q w — g ■ w for all w Cz F. In other words, we have 
h~^[{hg) ■w]—g-w,OT equivalently {hg) ■ w = h{g ■ w), for all h,g G G and w e F. 

Define ip: F — > G by f{w) = 1 ■ w. Then, for x G X, one has ipL{x) — 1 • l{x) = 
lp{x) = p{x) by construction. Furthermore, by the claim 

(p{v)(p{w) = ip{v){l ■ w) = {ip{v)l) ■ w ^ {1 ■ v) ■ w = I ■ {vw) = ip{vw) 

and hence 1^9 is a homomorphism such that (JlJ commutes. It remains to verify that 
if is unique. Suppose that r : F — > G is another such homomorphism. Define an 
action * of on G by g * u; = gT{w). Then g * l{x) = gTi{x) — gp{x) and so * 
coincides with • by uniqueness. Thus t{w) — 1*w — 1-w — (p{w), as required. □ 
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3. The Nielsen-Schreier Theorem 

We now present an elementary proof that subgroups of free groups are free via 
group actions. Let _F be a free group on X and H a subgroup. 

3.1. Schreier Transversals. A Schreier transversal ioi H < F is sl transversal T 
of H in F such that if we view T as a set of reduced words, then T is closed under 
taking prefixes (and hence in particular contains the empty word). The existence of 
Schreier transversals is a straightforward application of Zorn's Lemma. We include 
a proof for completeness. 

Lemma 2. There exists a Schreier transversal T of H in F. 

Proof. Consider the collection ^ of all prefix-closed sets of reduced words over 
X U X^^ that intersect each right coset of H in at most one element and order 3^ 
by inclusion. Then {1} G so it is non-empty. It is also clear that the union 
of a chain of elements from ^ is again in so ^ has a maximal element T by 
Zorn's Lemma. We need to show that each right coset of H has a representative 
in T. Suppose this is not the case and let w be a minimum length word such that 
Hw nT — 9. Since 1 e T, it follows w ^ 1 and hence it; = in reduced form 
where x ^ X U X^^ . By assumption on w, we have Hu = Ht for some t ^ T (and 
so Htx = Hw). If tx is reduced as written, then T 1+1 {tx} G 3^, contradicting the 
maximality of T. If tx is not reduced as written, then tx, or rather its reduced form, 
belongs to T (by closure of T under prefixes) and Hw — Htx. This contradicts the 
choice of w, completing the proof that T is a transversal. □ 

3.2. The Nielsen-Schreier Theorem. We now proceed to prove that subgroups 
of free groups using the tensor product construction. 

Theorem 3 (Nielsen-Schreier). Subgroups of free groups are free. More precisely, 
let F be a free group on X and let H be a subgroup. Fix a Schreier transversal T 
for H and put 

B = {te(ti)-i \{t,x)eT^ X, te(te)-i ^ 1}. 
Then H is freely generated by B. 

Proof. By Proposition [TJ it suffices to show that given a map a: B — > Sa, there 
is a unique action of H on A such that ab — a{b){a). For convenience, we extend a 
to i? U {1} by mapping 1 to the identity of Sa- 

First we prove uniqueness, as this will motivate the definition. So assume we 
have such an action and consider the tensor product A (x)h F . As usual we identify 
A®}iF with A X F / H where the action is given by (a, Hv)w = {avw{vw)~^ , Hvw). 
Our original action is the restriction of the action of H on A x F/H to the subset 
A X {H} (under the usual identifications) and hence is uniquely determined by the 
action of on A F, which in turn is uniquely determined by the action of the 
generators X of F. But ii x E X, then 

{a,Hw)x = {awx{wx)^^ , Hwx) — {a{wx{wx)^^){a), Hwx) 

(since wx{wx)~^ G i? U {!}) and hence is uniquely determined by a. 

Let us now take {a,Hw)x — {a{wx{wx)~^){a), Hwx) as the definition of an 
action of F on A x F/H; note that the action of F in the second coordinate is the 
usual action of F on F/H and so A x {H} is invariant under H. We must show 
that {a,H)b = {a{b){a),H) for b e B. 
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Claim. Suppose that t E T. Then for ah a E A, one has (a, H)t = (a, Ht). 

Proof of claim. We prove the claim by induction on the length of t (as a reduced 
word). If t is empty, then trivially the claim holds. Suppose first that t — ux 
with X E X as a, reduced word. By definition of a Schreier transversal, we have 
u E T and so by induction {a,H)ux = {a,Hu)x = {a{ux{ux)~^){a), Hux). But 
ux = t = ux and so the right hand side is (a, Hux) as required. 

Next suppose that t ~ ux^^ with x E X (as a reduced word). By definition 
of a Schreier transversal, u E T and so tx = u = tx. We need to verify that 
(a, Ht) = (a, H)t^ or equivalently, that (a, Ht)x — (a, H)u. But (a, iJ)^ = (a, iJw) 
by induction. On the other hand, {a,Ht)x — {a{tx(tx)^^){a), Htx) = (a,Hu) 
establishing the claim. □ 

To complete the proof, we must show that if t G T and x E X with tx{tx)^^ ^ 1, 
then [a, H)tx{tx)~^ — {a{tx{tx)^^){a), H); or equivalently, we must show that 



By the claim, the right hand side of ([3]) is {(T{tx{tx) ^){a), Htx), whereas the left 
hand side is {a,Ht)x — {(j{tx{tx)^^){a), Htx). This completes the proof that H is 



It is an easy combinatorial exercise to verify that the elements of B are distinct 
(the above proof does not provide this) and to count that if the size of X is n and 
[F : H] = TO, then B has l + TO(n— 1) elements (this is Schreier's formula). For the 
last statement one just observes that T x X has mn elements and that, for each 
non-empty word t E T, there is exactly one element x E X so that tx{tx)~^ = 1. 
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{a,H)tx= {a{tx{txy^){a),H)tx. 



(3) 



free on the set B. 



□ 
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